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Abstract. We estimate the relative importance of small and large earthquakes for static 
stress changes and for earthquake triggering, assuming that earthquakes are triggered 
by static stress changes and that earthquakes are located on a fractal network of dimen- 
sion D. This model predicts that both the number of events triggered by an earthquake 
of magnitude m and the stress change induced by this earthquake at the location of other 
earthquakes increase with m as ~ 10 Dm / 2 . The stronger the spatial clustering, the larger 
the influence of small earthquakes on stress changes at the location of a future event as 
well as earthquake triggering. If earthquake magnitudes follow the Gutenberg-Richter law 
with b > D/2, small earthquakes collectively dominate stress transfer and earthquake 
triggering, because their greater frequency overcomes their smaller individual triggering 
potential. Using a Southern California catalog, we observe that the rate of seismicity trig- 
gered by an earthquake of magnitude m increases with m as 10°™, where a = 1.05 ± 
0.05. We also find that the magnitude distribution of triggered earthquakes is indepen- 
dent of the triggering earthquake's magnitude m. When a « b, small earthquakes are 
roughly as important to earthquake triggering as larger ones. We evaluate the fractal cor- 
relation dimension D of hypocenters using two relocated catalogs for Southern Califor- 
nia. The value of D measured for distances 0.1 < r < 5 km is D = 1.54 for the Shearer 
et al. [2003] catalog and D = 1.73 for the Hauksson et al. [2003] catalog. The value of 
D reflects both the structure of the fault network and the nature of earthquake inter- 
actions. By considering only those earthquake pairs with inter-event times larger than 
1000 days, we can largely remove the effects of short-term clustering. Then D m 2, close 
to the value D = 2a — 2.1 predicted by assuming that earthquake triggering is due 
to static stress. The value D m 26 implies that small earthquakes are as important as 
larger ones for stress transfers between earthquakes, and that considering stress changes 
induced by small earthquakes should improve models of earthquake interactions. 



1. Introduction 

Large shallow earthquakes are followed by increased seis- 
mic activity known as "aftershocks". Aftershock sequences 
of small earthquakes are less obvious because the aftershock 
productivity is weaker, but can be observed after stack- 
ing many sequences [Helmstetter, 2003]. Several mecha- 
nisms have been proposed to explain earthquake triggering 
due to the static stress change induced by a prior event: 
rate-and-state friction [Dieterich, 1994], sub-critical crack 
growth [Das and Scholz, 1981; Shaw, 1993], viscous relax- 
ation [Mikumo and Miyatake, 1979], static fatigue [Scholz, 
1968], pore fluid flow [Nur and Booker, 1972], or simple sand- 
pile SOC models [Hergarten and Neugebauer, 2002]. Ka- 
gan and Knopoff [1987a] proposed that random, Brownian 
motion-like stress fluctuations cause Omori-law aftershock 
rate decay. Coulomb stress change calculations have been 
used to predict the locations, focal mechanisms and times 
of future earthquakes (see reviews by Harris [1998], Stein 
[1999] and King and Cocco [2001]). Because large earth- 
quakes modify stress over a much larger area than smaller 
ones, and because computing Coulomb stress changes re- 
quires a good model of slip distribution available only for 
large earthquakes, most studies have neglected the influence 



Copyright 2008 by the American Geophysical Union. 
0148-0227/08/$9.00 



of "small" earthquakes. Most researchers include only the 
mainshock and its largest foreshock or aftershock to pre- 
dict the location of future aftershocks. The success of this 
approach is significant but limited. Only about 60% of after- 
shocks are located where the stress increased after a main- 
shock [Parsons, 2002]; stress shadows (proposed decrease of 
the seismicity rate where Coulomb stress change is negative) 
are seldom or never observed [Marsan, 2003; Felzer et al., 
2003b]; and the correlation of Coulomb stress change with 
aftershock locations is rather sensitive to the assumed slip 
distribution [Steacy et al., 2004]. Coulomb stress change 
from the Mw 7.3 Landers earthquake does not explain the 
triggering of the Mw 7.1 Hector-Mine earthquake [Harris 
and Simpson, 2002] which occurred 7 years after Landers 
and 20 km away. Several alternative models have been pro- 
posed to explain the triggering of the Hector-Mine earth- 
quake: viscoelastic effects [Zeng, 2001], dynamic triggering 
[Kilb, 2003], or secondary aftershocks [Felzer et al., 2002]. 
Felzer et al. [2002] suggested that Hector-Mine may have 
been triggered indirectly by an aftershock of Landers, i.e., 
secondary aftershocks and small earthquakes may be impor- 
tant for stress triggering. 

A few studies have estimated the scaling of the after- 
shock number with the mainshock magnitude [Solov'ev and 
Solov'eva, 1962; Papazachos et al, 1967; Utsu, 1969; Singh 
and Suarez, 1988; Yamanaka and Shimazaki, 1990; Davis 
and Frohlich, 1991; Molchan and Dmitrieva, 1992; Shaw, 
1993; Drakatos and Latoussaki, 2001]. In these studies, the 
"mainshock" was defined as the largest event of a sequence, 
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and "aftershocks" were then selected in a space-time win- 
dow around the mainshock. These studies proposed that 
the number of aftershocks increases exponentially ~ 10 am 
with the mainshock magnitude m, with an exponent a in 
the range [0.65, 1]. Other researchers [Jones et al, 1995; 
Michael and Jones, 1998; Helmstetter, 2003; Felzer et ah; 
2004] later suggested that the same mechanisms may ex- 
plain two kinds of triggering: that of aftershocks by a pre- 
vious larger earthquake and that of a large earthquake by a 
previous smaller one. Indeed, the distribution of times be- 
tween an earthquake ( "foreshock" ) and a subsequent larger 
earthquake follows the Omori law as for usual aftershocks 
[Jones and Molnar, 1979], and the magnitude of triggered 
earthquakes is independent of that for the triggering event 
[Helmstetter, 2003]. In the work of Helmstetter [2003] and 
Felzer et al. [2004], "aftershocks" were selected as earth- 
quakes occurring in a space-time window after a "main- 
shock" (any earthquake not preceded by a larger one), and 
could be larger than the mainshock. Using different meth- 
ods (see section 3.4), Helmstetter [2003] found a = 0.8 ±0.1 
and Felzer et al. [2004] found a — 1 for Southern California 
seismicity. 

Other studies measured the exponent a using a statisti- 
cal model of seismicity [Ogata, 1989; Kagan, 1991b; Ogata, 
1992; Console et al, 2003; Zhuang et al, 2004], which as- 
sumes that any earthquake can trigger other events with a 
rate which decays in time according to Omori's law and 
which increases with magnitude as ~ 10 am . Ogata has 
coined the term "ETAS" for "epidemic type aftershock se- 
quence" to describe this class of models. However, these 
models are now being used to describe the statistics of all 
earthquakes, not just aftershocks. Here we will use the des- 
ignation "ETES" for "epidemic type earthquake sequence" . 
Optimizing the likelihood of the ETES model gives an esti- 
mation of the model parameter a. When applied to long- 
time catalogs, this method usually gives a smaller value of 
a, in the range 0.5-0.7 [Console et al, 2003; Zhuang et al, 
2004], than would result by simply counting the number 
of aftershocks. When applied to individual aftershock se- 
quences [Guo and Ogata, 1997], this method gives from one 
sequence to another a large variation of a, ranging from 0.2 
to 1.9 with a mean value of 0.86. 

Yamanaka and Shimazaki [1990] explained the value a = 
1 observed for interplate earthquakes by assuming that after- 
shocks are located on patches of the mainshock fault plane. 
If the stress drop is independent of magnitude, the density 
of aftershocks on the fault plane is a constant. Therefore, 
the number of aftershocks is proportional to the area of the 
mainshock rupture plane, i.e., a = 1. Hanks [1992] used the 
same model to show that, if a is equal to the exponent b of 
the Gutenberg-Richter (GR) law, small earthquakes are just 
as important as larger ones in redistributing tectonic forces 
on faults. Helmstetter [2003] later generalized this model 
by assuming that earthquakes are located on a fractal net- 
work of dimension D. This model predicts that stress trans- 
fers between earthquakes and earthquake triggering scales 
as ~ 10 Dm / 2 . Therefore, if earthquakes are very clustered 
in space with D < 2b, small earthquakes will be more impor- 
tant than larger ones for transferring static stress between 
events and thus for earthquake triggering. 

In this work we revisit the work of Helmstetter [2003] 
by using a different method to define mainshocks and af- 
tershocks, and by discussing the model in more detail. We 
then test the model by estimating the fractal correlation di- 
mension D of earthquake hypocenters, using two relocated 
catalogs for Southern California. We compare the observed 
value of a with the prediction a — D/2 [Helmstetter, 2003]. 

2. Scaling of stress transfers and earthquake 
triggering with the mainshock magnitude 

We estimate the relative importance of small earthquakes 
for stress transfers and for earthquake triggering. Our model 
is based on the following assumptions: 



• Earthquakes are triggered by a static stress step in- 
duced by the mainshock. Triggering from dynamic stress 
changes may have a different scaling with magnitude. We 
also assume that the mainshock does not change the stress- 
ing rate, i.e., we neglect the influence of post-seismic viscous 
relaxation on the seismicity. We do not impose a relation 
between the number of aftershocks and the stress change; 
we simply assume that the rate of aftershocks at point f 
at time t after a stress change a is a function which de- 
pends only on t and a. For instance, in the rate-and-state 
model [Dieterich, 1994], the instantaneous increase in seis- 
micity rate is proportional to exp(cr), but the total number 
of aftershocks (integrating over time) is proportional to the 
stress change. 

• A triggered earthquakes size is independent of the mag- 
nitude of the triggering event ( "mainshock" ) as suggested 
by [Helmstetter, 2003]. This implies that the crust is ev- 
erywhere close to failure, such that any small earthquake, 
triggered by a previous small one, can grow into an event 
much larger than its trigger. 

• We consider earthquakes with rupture width smaller 
than the thickness of the seismogenic crust, for which the 
rupture length L is proportional to its width W . 

• The rupture length scales with the seismic moment as 
L(M) ~ M^ d . For earthquakes smaller than the thick- 
ness of the seismogenic crust and for a constant stress drop, 
d — 3. For larger earthquakes, several models have been 
proposed, with d=l,d = 2ord — 3 (see Kagan [2004b] for 
a review). 

• Earthquakes are located on a fractal network of dimen- 
sion D. The spatial clustering of seismicity is due both to 
the geometry of the fault network and to earthquake inter- 
actions. Over long time-scales, aftershocks should cover uni- 
formly the active fault network, and thus should share the 
spatial distribution of the non-correlated background seis- 
micity. In our model, we define D as the fractal dimension 
of the long-term time-independent seismicity, not including 
short-term clustering due to earthquake interactions. 

For a constant stress drop, the Coulomb stress change at 
point r due to a finite dislocation of length L and width 
W ~ L depends only on the ratio r/L and the direction $ 
(set of 3 angles) between the rupture plane and the fault 
plane at point r on which we compute the stress [Kagan, 
1991c]. This means that the average stress change at a dis- 
tance r proportional to L from an earthquake of length L 
is independent of this earthquake's magnitude. The only 
difference between small and large events is that larger ones 
increase the stress over a larger area. We can thus write 
the stress change (tensor) as a function cr(r/L,$). We first 
estimate the scaling of stress transfers between quakes as a 
function of the magnitude of the event which increases the 
stress. The expectation of the norm of the stress tensor in- 
duced by an earthquake of length L, integrating over the 
location of all earthquakes is given by 

a(L)=E(\\a(r/L,<S>)\\)= J J \\a(r/L,<i>)\\p r (r)dr(l) 
<s> o 

where $ are limits for a set of 3 angles characterizing fault 
orientation (their exact form depend on the parametriza- 
tion method) and p r (r) is the time-independent density of 
earthquakes at a distance r from another earthquake. 
For a fractal distribution, 

Pr(r) ~ r D ~\ (2) 



HELMSTETTER, KAGAN, AND JACKSON: STRESS TRANSFERS AND EARTHQUAKE TRIGGERING X - 3 



where D is the correlation dimension of earthquake hypocen- 
ters. The function p r (r) in (1) describes the long-term spa- 
tial distribution of seismicity, not including clustering due 
to aftershocks. Introducing (2) in (1) and using the new 
variable x — r/L, we have 



a{L) 



d$ / a{x,<t)p r (x)dx . 



(3) 



The integral in (3) is now independent of the events length; 
the only dependence on magnitude is in the factor LP . The 
integral is convergent both in the near field (the stress has a 
r -1 / 2 singularity close to the crack tip which is integrable) 
and in the far field if D < 3 because a(x) decays as 1/x 3 for 



x ^> 1. Assuming that L ~ M, 



1/3 



10" 



(for a constant 



stress drop and for L proportional to W), we can rewrite the 
average stress o(Mo) induced by an earthquake of moment 
Mo at the location of another event (distributed according 
to p r (r)) as [Kagan, 1994] 



<t(M ) ~ M, 



D/3 



(4) 



If earthquakes are distributed uniformly in a volume (D = 
3), or if we compute the integral of the stress change induced 
by a quake on a volume rather than at the location of other 
events, then a(Mo) ~ Mo- In other words, stress change 
integrated on a volume is proportional to seismic moment, 
and is therefore dominated by the largest earthquakes in a 
catalog if the exponent (3 of the cumulative moment distri- 
bution is P < 1 (i.e., b < 1.5 for magnitudes). In contrast, 
if we compute the stress changes at the location of other 
quakes, and if D w 2, then the increase in the stress change 
with Mo is balanced by the decreased number of events pro- 
portional to Mq with seismic moment (equivalent to GR 
law with 6=1). In this case D — 3/3, small earthquakes are 
just as important as larger ones for stress transfers between 
earthquakes. 

If earthquake triggering is due to static stress changes, 
the seismicity rate N(r, t) at point (r, $) and time t can 
be written as a function f(o, i) of the stress change a and 
of the time t after the stress change. The function f(a, t) 
describes any physical mechanism of earthquake triggering 
due to static stress changes, such as rate-and state friction 
[Dietemch, 1994], sub-critical crack growth [Das and Scholz, 



Mikumo and Miy- 
or pore fluid flow 



1981; Shaw, 1993], viscous relaxation 
atake, 1979], static fatigue [Scholz, 1968 
[Nur and Booker, 1972]. The cumulative number of earth- 
quakes Na.it (L) triggered by an earthquake of length L in 
the time window [c, T] is given by the integral over time 
and space of the seismicity rate 



iVaft(i) = J dtj d$J f(a,t)p r (r)dr 

c * 

T oo 

= L D J dt J d$ J f(a,t)p r (x)dx . 

c # 



(5) 



The minimum time c is introduced to avoid the singular- 
ity of the seismicity rate at time t = predicted by some 
models of stress triggering (e.g., in the case of the rate-and- 
state model [Dieterich, 1994] when a singularity occurs in 
the stress field). The maximum time T is needed to regular- 
ize the integral at large times for some models which predict 
an Omori law decay with p < 1 (but we can take T — oo in 
Dieterictis [1994] model). As for the stress change a(L) in 
(3), the number of triggered earthquakes depends on mag- 
nitude only in the factor L D . The scaling of aftershock pro- 
ductivity with the size of the triggering earthquake is the 



same as for stress transfers. For seismic moment 

AU(Mo) ~M D/ \ (6) 

and for magnitude 



iV aft (m) ~ 10 am with a = 0.5D . 



(7) 



The result (7) does not depend on the physical mechanism 
of earthquake triggering, i.e., on the specific form of the 
function f(a,t) in (5), but is valid for any model in which 
aftershock properties (seismicity rate, location, duration of 
the aftershock sequence) are functions of the static stress 
change induced by the mainshock, and on the time since 
the stress change, and have no other dependance on the 
mainshock magnitude. In the next section, we measure the 
scaling of the aftershock rate with the mainshock magnitude 
for Southern California seismicity. We then estimate the 
fractal dimension D of the spatial distribution of hypocen- 
ters in section 5 and compare the results with the model's 
prediction (7). 

3. Scaling of earthquake triggering with 
mainshock magnitude: observations for 
California seismicity 

We have measured the average rate of triggered earth- 
quakes ( "aftershocks" ) following a "mainshock" in Southern 
California to measure the scaling of the number of triggered 
earthquakes with the triggering magnitude. We first dis- 
cuss some potential problems in analyzing aftershock prop- 
erties. We describe the catalog in section 3.2. We then de- 
fine "mainshocks" and "aftershocks" and explain our declus- 
tering procedure. We present our results on the scaling of 
earthquake triggering with the magnitude of the triggering 
event in section 3.4, and findings on the magnitude distri- 
bution of triggered earthquakes in section 3.5 

3.1. Problems with estimating aftershock properties 

Several properties of aftershocks are difficult, if not im- 
possible, to estimate and may be very sensitive to the pa- 
rameters of the declustering procedure. It is particularly 
difficult to define the total number of aftershocks. If earth- 
quake triggering follows Omori's law ~ l/t p , with p > 1 and 
without cut-off at short times, there is an infinite number 
of aftershocks shortly after the mainshock when the seismic 
network is saturated. These aftershocks are not reported 
in the catalogs. For p < 1, there are also infinitely many 
aftershocks at very long times (between an arbitrary time T 
following the mainshock up to t = oo) which are mixed up 
with non-correlated, background quakes. Therefore, we miss 
most aftershocks if we select earthquakes in seismicity cat- 
alogs in a finite time window after the mainshock. Because 
it is impossible to measure the total number of triggered 
events, an alternative solution is to measure the rate of af- 
tershocks in a finite time window after the mainshock, when 
the catalog is complete and before the rate goes back to the 
constant background level. If the temporal variation in the 
number of triggered events is the same for all mainshock 
magnitudes (as expected if the rate of aftershocks depends 
only on the stress and the time since the stress step), then 
the scaling of the total number of aftershocks (exponent a 
in (7)) can be measured from that of the rate of triggered 
events with the mainshock magnitude. 

It is also impossible to distinguish between the "direct" 
(triggered by the "mainshock" only) and "secondary" after- 
shocks (triggered by a previous aftershock from the main- 
shock). Because an earthquake is probably not triggered 



X - 4 HELMSTETTER, KAGAN, AND JACKSON: STRESS TRANSFERS AND EARTHQUAKE TRIGGERING 



by a single prior quake, but rather by the cumulative ef- 
fect of all previous events (with a weight depending on the 
time, distance and magnitude of this particular quake), it is 
impossible to tell which earthquake was triggered by which 
other one. Only a stochastic answer can be obtained, as- 
suming a particular model of earthquake interactions: for 
example, that earthquakes obey the Epidemic Type Earth- 
quake Sequences (ETES or ETAS) model [ Zhuang et al., 
2004]. After estimating the parameters by maximizing the 
likelihood of the model, Kagan and Knopoff [1976] and later 
Zhuang et al. [2004] have proposed a method to calculate 
the probability that an earthquake was triggered by a pre- 
vious one, or that it was a background event. However, if 
the number of secondary aftershocks is proportional to the 
number of direct aftershocks, then we can measure the total 
(observed) rate of triggered seismicity, including direct and 
secondary aftershocks, which will have the same dependence 
on the mainshock magnitude as does the number of direct 
aftershocks [Helrnstetter and Sornette, 2003]. 

Rather than by fitting a multi-parameter stochastic clus- 
tering model, such as ETES model, we have estimated earth- 
quake clustering properties here directly from the earth- 
quake catalog, with some simple selection criteria. Although 
the ETES type models usually describe earthquake cluster- 
ing well, using them to determine parameters like a can 
be problematic. First, the estimates so determined are not 
very robust, because the model parameters are strongly cor- 
related and poorly resolved. Second, the parameter esti- 
mates may be biased. Inevitable errors in magnitude, loca- 
tion, and temporary incompleteness after larger events [Ka- 
gan, 2004a] may each bias earthquake statistics by under- 
estimating clustering at short distances and times. Kagan 
[1991b, see p. 142 and his Eq. 36] argues that the value of 
a obtained by maximum likelihood inversion of ETES pa- 
rameters depends on the method used to account for the 
incompleteness of short-term aftershocks in a catalog. Fur- 
thermore, the ETES models generally assume isotropic clus- 
tering, whereas earthquakes occur preferentially on spatially 
clustered faults. Neglecting this spatial clustering can lead 
to underestimating a, because the contribution to the to- 
tal predicted seismicity rate from all small aftershocks bet- 
ter predicts the location of future aftershocks than does 
the isotropic contribution from the mainshock. A smaller 
a gives more weight to smaller earthquakes and therefore 
better accounts for the following factors: incompleteness of 
the catalog after a large earthquake, the heterogeneity of 
the spatial distribution of aftershocks, fluctuations of after- 
shock productivity, and magnitude errors (see Helrnstetter 
et al. [2004] for more details). 

3.2. Data 

We used the catalog of seismicity for California pro- 
vided by the Advanced National Seismic System (ANSS) 
(available at http://quake.geo.berkeley.edu/anss/catalog- 
search.html). This catalog merges data from several net- 
works (for this work, essentially Southern and Northern 
California and Nevada). We used earthquakes larger than 
a magnitude threshold mj = 2, within the time win- 
dow 1980/1/1-2004/10/15, and within the rectangular area 
32° < latitude < 37.5° and -122° < longitude < -114°. 
This zone does not include Long Valley volcanic seismicity. 
We checked that the catalog is complete for m > 2 in this 
area and time period (except at short times after a large 
earthquake). We removed explosions from the catalog. 

When available, for large earthquakes, we used moment 
magnitudes provided by the Harvard catalog (available at 
http:/ /www.seismology.harvard.edu/CMTsearch.html, see 
Ekstrom et al. [2003] and references therein). Other- 
wise we used the magnitudes provided by the ANSS 



catalog (see http://quake.geo.berkeley.edu/ftp/pub/doc/- 
cat5/cnss.catalog.5 for more details). The magnitudes for 
smaller earthquakes are generally uil, or local magnitudes 
estimated from short-period body waves. Our use of magni- 
tudes other than moment magnitude is a potential problem, 
because the scaling relationships and simple static stress 
models we employ are based on the seismic moment, which 
we assume is simply related to magnitude. Hutton and Jones 
[1993] show uil is a fairly good unbiased surrogate for mo- 
ment magnitude in the magnitude range 4.5 to 6, but there 
may be a significant bias for quakes smaller than 4.5. At this 
point we have to assume that any bias does not affect the 
results much, but in the future this assumption needs to be 
tested using moment magnitudes for smaller earthquakes. 

3.3. Selection of mainshocks and aftershocks 

The objective of the declustering procedure is to select 
as mainshocks only events not preceded by any large earth- 
quake; then quakes following mainshocks within a specified 
time and distance are identified as aftershocks. The space 
and time windows are chosen in order to minimize the influ- 
ence of non-correlated earthquakes. There are always arbi- 
trary choices in a declustering procedure, so we tested dif- 
ferent values for all adjustable parameters. As a test wc 
also applied this declustering method to synthetic catalogs 
to estimate the accuracy and lack of bias in results. 

Because we want to investigate the temporal decay of trig- 
gered seismicity and the scaling of aftershock productivity 
with the mainshock magnitude, we want a declustering pro- 
cedure which makes minimal assumptions about the prop- 
erties of aftershock triggering in time, space and magnitude, 
without introducing a-priori any scaling of the number of af- 
tershocks or their duration with the mainshock magnitude. 
We assume that larger earthquakes influence seismicity in 
a wider area, proportional to the mainshock rupture area 
[Kagan, 2002a]. 

Practically, we select as mainshocks any earthquake of 
magnitude tum not in the influence zone Rf x Tf of a 
previous event of magnitude m > tum — Am. We use 
generally Am = 1. This choice ensures that aftershocks 
of the foreshock are negligible compared to aftershocks of 
the "mainshock" because aftershock productivity increases 
rapidly with the mainshock magnitude. We compute dis- 
tances between earthquake epicenters, because depth has a 
larger error than horizontal coordinates. The spatial influ- 
ence zone i?p of a foreshock is defined as 

7?i?(m) = max(Z?F, ^Vl,f x L(m)) , (8) 

where the constant Df accounts for a location's accuracy, 
L(m) is the rupture length [ WGCEP, 2003] of an earthquake 
of magnitude m 

L(m) = 0.01 x 10°' 5m (km) , (9) 

and A r L,F is an adjustable factor. Mainshocks are selected 
with a rectangular area 32.1° < latitude < 37.4° and 
-121.9° < longitude < -114.1° slightly smaller (by 0.1°) 
than the box used to select foreshocks and aftershocks so as 
to avoid finite size effects. 

We then select as aftershocks of a mainshock all quakes 
of magnitude m > mj in the influence zone Ra x Ta of the 
mainshock, or of a previous aftershock of the mainshock, 
even if they are larger than the mainshock. The size of a 
cluster can thus increase with time, due to aftershock diffu- 
sion or due to secondary aftershocks, as in Reasenberg [1985]. 
We use a space window 

R A (m) = nrnx{D A ,N L , A x L{m)) . (10) 

We use parameters Da and At, a in (10) generally smaller 
than the parameters Df and Al.f used to define main- 
shocks, because we do not want to consider as a mainshock 
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any earthquake influenced by a previous one, and we do not 
want to include as aftershocks distant quakes mixed with 
the background. Thus, our declustering method has 7 ad- 
justable parameters: Da, Dp, Nl ; a, Nl,f, Ta, Tf and Am. 
With our algorithm, a small fraction of earthquakes are con- 
sidered as aftershocks of several mainshocks. For instance, 
Landers aftershocks are also considered as aftershocks of a 
m = 2.8 earthquake which occurred 6 hours before Landers. 

Our method is very similar to the one of Reasenberg 
[1985], except that we do not require that aftershocks be 
smaller than the mainshock, and we do not include a priori 
hypotheses on either the scaling of aftershock productiv- 
ity with the mainshock or on the temporal decrease of the 
seismicity rate with time. By contrast, Reasenberg [1985] 
assumes that aftershock productivity increases as 10 2m / 3 
and that the rate of aftershocks follows the Omori law with 
p=l. 

The main differences between this work and the previous 
analysis of [Helmstetter, 2003] are that Helmstetter [2003] 
used a constant value for Rf = 50 km (independent of the 
foreshock magnitude), Helmstetter [2003] also used Am = 0, 
and selected as aftershocks earthquakes in the influence zone 
of the mainshock only (not in the influence zone of a previous 
aftershock). We also use here a different catalog from [Helm- 
stetter, 2003] (with different minimum magnitude, space and 
time windows). 

Our definition of triggered events is also very similar to 
that of Felzer et al. [2004], but the parameters (minimum 
magnitude, time interval) and the method used to com- 
pute a differ. The only difference in the algorithm of af- 
tershock selection is that Felzer et al. [2004] selected after- 
shocks within the influence zone of either the mainshock or 
of its largest aftershock, while we select aftershocks in the 
influence zone of both the mainshock and all its aftershocks. 
This difference should have only a minor effect, as aftershock 
diffusion is very weak if there is any at all [Helmstetter et 
al, 2003]. Secondly, if a mainshock triggers a larger event, 
most of the following events will occur in the influence zone 
of the largest earthquake, which generally includes the in- 
fluence zone of the previous "mainshock" . 

3.4. Scaling of earthquake triggering with the 
magnitude of the triggering events 

We stack all aftershock sequences which have the same 
mainshock magnitude rriM , for each class of mainshock mag- 
nitude ranging from 2 to 7 with a bin size of 0.5. We use 
a kernel method [Izenman, 1991] to estimate the seismicity 
rate, by convolving the logarithm of earthquake times with a 
gaussian kernel of width h = 0.1 for rrid = 2 and h = 0.2 for 
mj = 3. The results for Df = 3 km, Da = 2 km, Nl,a = 2, 
Nl,f = 3, Tf = 1 yr, Am = 1 and mj = 3 are shown in Fig- 
ure 1. In choosing this parameter, we have checked that our 
"mainshocks" are almost uniformly distributed in time, and 
are therefore not strongly influenced by other earthquakes. 

For each mainshock magnitude, the aftershock rate de- 
cays with the time since the triggering event approximately 
as 

\{t,m M )= tp , (11) 

with p ~ 0.9. This expression (11) is only correct in a finite 
time window [i m m(m,M), tmnx(m M )], for which the catalog 
is complete above rrid and the proportion of non-correlated 
events is negligible. The Omori law has been reported in 
many observations of aftershock sequences (see Utsu et al. 
[1995] for a review), and can be derived from several phys- 
ical models of earthquake triggering. For example, Omori 
decay (11) with an exponent p « 0.9 can be reproduced by 
the rate-and-state model of Dieterich [1994] using a non- 
uniform stress distribution. Due to our rule of aftershocks 
and mainshocks selection, the seismicity rate after the main- 
shock is much larger than before. We conclude that, when 



Omori law decay with p « 0.9 is observed, the earthquakes 
which we define as "aftershocks" are probably causally re- 
lated to (triggered by) the "mainshock" rather than simply 
correlated to it. 

Based on analyzing the magnitude distribution for several 
aftershock sequences in Southern California, Helmstetter et 
al. [2004] have proposed a relation between the magnitude 
of completeness m c (t,mM) as a function of the mainshock 
magnitude and of the time (in days) since the mainshock 

m c (t,m M ) = m M — 4.5 — 0.761og 10 (t) 

and m c (t,rnM) > 2. (12) 

Earthquakes smaller than this threshold are generally not 
detected due to overlapping of seismic records and satura- 
tion of the network. Of course some fluctuations of m c occur 
from one sequence to another, but this relation gives a good 
fit to all sequences we analyzed within 0.3 magnitude units. 
For Landers m = 7.3, expression (12) predicts that the com- 
pleteness magnitude recovers its usual value rrid — 2 about 
10 days after the mainshock. 

For each value of mm, we fit the rate of aftershocks 
by (11) with p — 0.9, in the time interval [i m in(m,M), 
(maxfniiif)], to estimate K(niM). The minimum time 
Wn (rriM, rrid) is the time after which the catalog is complete 
for m > rrid, estimated using (12). Parameter t max (miVf) is 
either fixed to 10 days or given by the condition \{t,rriM) > 
A m in, where A m i n = 0.01 day -1 is the seismicity rate be- 
low which that rate does not obey Omori's law (11). At 
large times, the seismicity rate goes to a constant level, due 
to the background rate or the influence of non-correlated 
aftershock sequences (see Figure 1). For small vtiM, the 
seismicity rate increases at large times because the cluster 
size increases with time, as we enlarge the cluster with new 
earthquakes related to previous ones. The peak at t ~ 7 
years for 6 < tum < 6.5 and 7 < tum < 7.5 is due to the 
M = 7.1 1999 Hector-Mine aftershock sequence, which is 
included in the clusters of the 1992 Landers M — 7.3 and 
Joshua- Tree M = 6.1 earthquakes. 

Figure lb shows that aftershock productivity increases 
exponentially with mainshock magnitude. The value of 
K(mu) (representing the seismicity rate per day for m > 
m d at time t — 1 day after a mainshock of magnitude mm) 
is given by 

K{m M ) = A'ol0" mM Kn f,mt \ (13) 

with Ko — 0.0033 day p ~ and a = 1.07. We recover almost 
the same a value as Felzer et al. [2004], who uses a similar 
method of aftershock selection with parameters Am = 0, 
T F = 30 days, T A = 2 days, Nf = Na = 2, Df = Da = 5 
km and two values of rrid = 3.5 and rrid = 4.5. They mea- 
sured a by estimating the scaling of the total number of 
aftershocks in a 2-day period with the mainshock magni- 
tude. This value of a is close to the GR 6-value (see Figure 
lb). The b- value measured by mean-square linear regres- 
sion of the cumulative distribution for m > 2 in the time 
window 1980-2004 gives b = 0.94. A maximum likelihood 
method gives b = 1.03. This value of b « a implies that all 
earthquakes in a given magnitude range collectively attain 
the same importance for earthquake triggering, as suggested 
previously by Agnew and Jones [1991], Michael and Jones 
[1998] and Felzer et al. [2002, 2004]. For a = b, the in- 
crease of aftershock productivity with mm compensates for 
the decreased number of earthquakes with magnitude. Even 
if an earthquake of magnitude m — 7 triggers on average 10 5 
times more aftershocks than a magnitude 2, an earthquake of 
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any magnitude is as likely to be triggered by a m = 7 earth- 
quake as by a m = 2, because there are 10 5 more m — 2 
than m — 7 earthquakes in the catalog. 

The roll-off of the seismicity rate at times t < i m in(niM) 
can be explained by catalog incompleteness at shorter times 
[Kagan, 2004a]. To check this, we have corrected the seis- 
micity rate using GR law with b = 1 to estimate the rate 
of seismicity for m > ma from the observed rate of events 
for m > m c (mM,t). We have removed from the catalog all 
events with m < m c (mM,t), measured the rate of seismicity 
A m > mc (i, m M ) as a function of time and mainshock magni- 
tude, and then estimated the rate of m > m d earthquakes 
by 



(t,m M ) = A m > mc (t,m M )10 mc(t ' mM) - md . (14) 



The results are shown in Fig. 2: the rate of seismicity now 
follows the Omori law for t > 2 x 10 days for all main- 
shock magnitudes. The cut-off at t ^ 10~ 3 days (86 sec) 
is probably due to the fact that earthquakes at such small 
times cannot be distinguished from the mainshock and are 
therefore not reported in the catalog. 

We have tested different values of the parameters of af- 
tershock and mainshock selection (see Table 1). The fitting 
interval (i max and A m i n ) was adjusted so that the seismic- 
ity rate for t < t max and \(t) > A min decays according to 
Omori's law with p ^ 0.9, as expected for triggered earth- 
quakes. All tests, for reasonable values of Da smaller than 
the location accuracy (2 km for A-quality locations), give 
a = 1.05 ±0.05. There are however large variations of Kq as 
a function of the parameters of aftershock and mainshock se- 
lection. The value of a exceeds the value a — 0.8 ±0.1 found 
by Helmstetter [2003] . This discrepancy is due to differences 
in the selection of aftershocks and mainshocks. Helmstetter 
[2003] used a constant value of Rf = 50 km, independent 
of the "foreshock" magnitude. This value Rf = 50 is too 
small to exclude distant Landers aftershocks. Helmstetter 
[2003] also used a larger value of i max = 1 yr, independent 
of mM, so that a significant fraction of background events 
may have contaminated aftershocks of small mainshocks. 

We have applied the same method (selection of main- 
shocks and aftershocks, correction for undetected early af- 
tershocks, and fit by Omori's law) to synthetic catalogs gen- 
erated with the ETES model. In this model, the seismicity 
rate X(t, r, m) is the sum of a constant background term /j, 
and of aftershocks from all previous earthquakes: 



(m— m ) 



\{t,r,m) = p m (m) + ^ fclO" 

ti<t 

6c e vL{mi) v 



(t + C y+ e (|>r-r l |±L(m i )) 1 + 1 ' 



(15) 



where L(m) = 0.01 x I0°' 5m is the rupture length, p m (m) 
is the magnitude distribution, c, 9, k, a, v are adjustable 
parameters. Our method recovers the a value of the model 
with an accuracy ~ 0.03, but underestimates the total num- 
ber of aftershocks, because distant aftershocks outside the 
"influence zone" used for aftershock selection are missed (see 
results in Table 1). 

In order to study the rate of aftershocks at large times 
after a mainshock, we have selected aftershocks very close 
to the mainshock, in order to minimize the influence of non- 
correlated earthquake. We used parameters Da = 0.5 km, 
Nl,f = 2, Df — 3 km, and Nl,f = 3, and we did not 
include aftershocks in the influence zone of previous after- 
shocks (see model #8 in Table 1). We thus under-estimate 
the number of aftershocks for small mainshocks, because the 
influence zone of small events is smaller than the location 
accuracy, and because we miss more secondary distant af- 
tershocks for small mainshocks. The results are shown in 
Figure 3. We observe that the duration of the aftershock 



sequence is at least 1000 days, independent of the main- 
shock magnitude, as predicted by the rate-and-state model 
of seismicity [Dieterich, 1994]. This figure also confirms that 
Omori's exponent does not depend on the mainshock mag- 
nitude. 

The instantaneous increase of seismicity rate after a main- 
shock, compared to the long-term rate, is at least of a factor 
10 5 , maybe larger if we could detect aftershocks at shorter 
times. Using the rate-and-state model [Dieterich, 1994], this 
means that the ratio Aa/Aa„ (where Act is the maximum 
Coulomb stress change, A is a parameter of the rate-and- 
state friction law, and a„ is the normal stress) is at least 
equal to log(10 5 ) = 11.5. Assuming Aa n = 0.4 bar [Toda 
and Stein, 2003], this gives a maximum stress of 4.6 bar. 

3.5. Magnitude distribution of triggered earthquakes 

We have analyzed the magnitude distribution of triggered 
earthquakes for each class of the mainshock magnitude be- 
tween 2 and 7. As above, we select aftershocks in a time 
window [tmin(w-M), tmax(niM)] such that the catalog is com- 
plete above mj = 2 and that the influence of non-correlated 
earthquakes is negligible (A(t, tum) > 0.1 day -1 ). The re- 
sults are shown in Figure 4. We observe that the magni- 
tude distribution of triggered earthquakes follows the GR 
law with b « 1, independent of the mainshock magnitude. 
For 7 < m M < 7.5 (Hector-Mine and Landers), the 6- value 
seems larger than 1. However, if we use m d — 3 and a 
smaller minimum time i m in, the magnitude distribution is 
close to the GR law with 6 = 1. These results imply that 
a small earthquake can trigger a much larger earthquake. 
It thus validates our hypothesis that the size of a triggered 
earthquake is not determined by the size of the trigger, but 
that any small earthquake can grow into a much larger one 
[Kagan, 1991b; Helmstetter, 2003; Felzer et at, 2004]. The 
magnitude of the triggering earthquake controls only the 
number of triggered quakes. 

3.6. Proportion of triggered events in catalogs 

We use the scaling of the average number of triggered 
earthquakes per mainshock and the hypothesis that the 
magnitude distribution of all quakes follows the GR law 
to derive the long-term fraction of aftershocks (assuming 
boundaries in time and space for the definition of after- 
shocks). The rate of m > md aftershocks triggered, directly 
or indirectly, by an earthquake of magnitude m at time t 
after this quake follows approximately 



\m(t) = 



tp 



(16) 



with p = 0.9 and K = 0.0041 day p_1 . This relation holds 
at least for times t > 0.001 day (86 sec), for all mainshock 
magnitudes, after correcting for the incompleteness of the 
catalog at short times using eq. (12). For m > 7, the 
Omori law decay holds at least up to T ~ 100 days. If 
we assume that this "aftershock duration" does not depend 
on the mainshock magnitude, the average total number of 
m > 2 aftershocks (including secondary aftershocks) trig- 
gered by m > md mainshocks in the time window c < t < T 
is 



AMm) = J \ m (t)dt = K W am - bm « — P ] —y P .(17) 



Averaging over all magnitudes of the mainshock we can com- 
pute the average total number iV a f t of m > m d aftershocks 
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(including secondary aftershocks) per mainshock of magni- 
tude m d < m < m max . Assuming a GR law with b = 1 and 
with an upper magnitude cut-off at m = m m ax, AT a ft is given 
by 



m max 

JVaft = J p(m)N a ,f t (m)dm 



(18) 



rpl-p _ 1-p f** 

= Ko 1 / b 

1-p J 



ln(10)10- 6m 10 am dm 



K bW {a - b)md 



[1 - 10' 



(a-6)(m max -m d )" 



b — a 

The special case a — b gives 



t i_p_ c i_p 



AT aft = if ( ,fcln(10)(m max - m d ) T ^— ^ . (19) 

Using the parameters a = 1.05, b = 1., c = 0.001 day, 
Ko = 0.0041 day p_ , and mj = 2 estimated for Southern 
California (see Table 1 line 2) and assuming a maximum 
magnitude m max = 8 and an aftershock duration T = 100 
days, we obtain AT a ft = 1.11. In other words, a fraction 
•Waft /(-Waft + 1) = 53% of m > 2 earthquakes in Califor- 
nia are aftershocks of a m > 2 earthquake. If we assume 
that Omori's law with p — 0.9 holds up to T = 10000 days 
(27 yrs), then iV a f t = 2.06 (67% of earthquakes are after- 
shocks). The results do not depend on c if p < 1. The 
fraction of aftershocks increases if we include mainshocks 
with m < 2. Assuming that equation (17) and the GR law 
still hold down to m d = —1 [Abercrombie, 1995], we obtain 
iV a ft = 2 for T = 100 days: 67% of earthquakes of any mag- 
nitude are triggered by earthquakes of magnitude m > — 1. 
The fraction of triggered events goes to 1 as the minimum 
magnitude m d goes to — oo [Sornette and Werner, 2004]. 

Our results are relatively close to the generic model of 
Californian aftershocks Reasenberg and Jones [1989, 1994] 
(RJ), which assumed 



Xm(t) 



(t + c)p 



(20) 



with c = 0.05 days, p = 1.08, b = 0.91, a = -1.67 [Reasen- 
berg and Jones, 1994]. Their model gives K = 10 a = 0.017 
day -1 for m d — 2, but with smaller a and b values (a is 
assumed to be equal to b), so that the rate of m > 2 after- 
shocks at t = 1 day and for rn — 7 is A m (t) = 758 day -1 for 
RJ model and A m (t) = 729 day -1 for our model (line #2 in 
Table 1). 

4. Scaling of stress transfers with the 
mainshock magnitude 

Computing stress changes induced by a small earthquake 
at the location of another quake is very difficult. The stress 
field created by an earthquake is sensitive to the slip dis- 
tribution on the fault, at least for small distances from the 
fault plane [Steacy et at, 2004]. Slip distribution is usually 
available only for large m > 6 earthquakes in California. 
For small earthquakes, we can use a point-source model if 
we know the focal mechanism, to calculate stress for dis- 
tances from the fault much larger than the rupture dimen- 
sion. We can also estimate the most likely focal plane from 
the fault orientations in the area or from the orientation of 
the tectonic stress. Then we can model the rupture by a 
rectangular dislocation with a uniform or tapered slip. But 
this simple source model would still be incorrect close to the 
fault, where most aftershocks are located. 



Marsan [2004] solved this problem by computing the dis- 
tribution of the stress induced by an earthquake at the loca- 
tion of another earthquake, and by studying only the tail of 
the distribution (small absolute value of stress) correspond- 
ing to distances larger than the rupture length of the event. 
Marsan [2004] concluded that small earthquakes are at least 
as important as larger ones for redistributing stress between 
events. This result is in agreement with our result for scal- 
ing the number of triggered earthquakes with the mainshock 
magnitude. But even in the far field, these calculations of 
the Coulomb stress change induced by an earthquake are 
very inaccurate, because the accuracy of focal mechanisms 
is on the order of 30° [Kagan, 2002b]. Another problem in 
[Marsan, 2004] is that he considers earthquakes of magni- 
tude m = 3, which have a rupture length L « 300 m, much 
smaller than the average accuracy on vertical coordinates of 
4 km. These location errors can significantly bias any esti- 
mate of the stress change induced by small m < 5 quakes 
[Hue and Main, 2003] . 

The observation that earthquake triggering scales with 
the magnitude m of the trigger as ~ 10 Qm with a w 6 sug- 
gests that small events should not be neglected in studying 
stress interactions between quakes. However, directly calcu- 
lating the stress induced by a small events is impossible due 
to the inaccuracy of earthquake locations and focal mecha- 
nisms. Even if we cannot compute the spatial distribution 
of the stress induced by a small earthquake, we can esti- 
mate how such small events trigger others. On average the 
total number of events triggered by an earthquake (inte- 
grated over space) will be independent of the earthquake 
source. For instance, we can account for the influence of 
small earthquakes in the rate-and-state model of Dieterich 
[1994] by using for such small events a point source model 
which, as we know, depends only on the focal mechanism. 
We can then compute the seismicity rate on each point of 
a grid by integrating on each cell that rate estimated us- 
ing equation (12) of [Dieterich, 1994] (using space-variable 
stress for a point-source model). If the size of each cell is 
equal to a few km, a point source model approximates m < 4 
earthquakes well, because the integral of the seismicity rate 
over each cell depends only slightly on the geometry of the 
rupture fault and the variations of fault slip. 

Another alternative is to use empirical laws like the ETES 
model [Kagan and Knopoff, 1987b; Felzer et al, 2003a; 
Helmstetter et al., 2004], rather than stress calculations and 
physical models of stress interactions. We can estimate the 
spatial distribution of future aftershocks for a large earth- 
quake by smoothing the locations of early aftershocks. A 
magnitude m — 7 quake has enough aftershocks in the 
first hour to estimate the distribution of future aftershocks 
[Helmstetter et al, 2004]. In contrast, predictions based on 
Coulomb stress change calculations require a good model of 
slip distribution on the fault, which is not available until a 
few hours after the mainshock at best. 



5. Spatial distribution of seismicity 

We have estimated the distribution of distances be- 
tween hypocenters p r (r), using the relocated catalogs for 
Southern California by Hauksson et al. [2003] (HCS) and 
Shearer et al. [2003] (SHLK). These catalogs apply waveform 
cross-correlation to obtain precise differential times between 
nearby events. These times can then be used to greatly im- 
prove the relative location accuracy within clusters of sim- 
ilar events. Locations in these two catalogs do not always 
agree in detail, reflecting their different modeling assump- 
tions and seismic velocity structures. However, their overall 
agreement is quite good, particularly when compared to the 
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standard catalog locations. In many regions, these new cat- 
alogs resolve individual faults in what previously appeared 
to be diffuse earthquake clouds. In both catalogs, we have 
selected only m > 2 earthquakes relocated with an accuracy 
of e h and e z smaller than 100m. In the HCS catalog, there 
are 71943 m > 2 earthquakes between 1984 and 2002, out 
of which 24127 (34%) are relocated with e h < 0.1 km and 
e z < 0.1 km. In the SHLK catalog, there are 82442 m > 2 
earthquakes in the same period [1984, 2002] , out of which 
33676 (41%) are relocated with e h < 0.1 km and e z < 0.1 
km. The probability density function of distances p r (r) be- 
tween hypocenters is close to a power-law p r (r ~ r D_1 in 
the range 0.1 < r < 5 km. The correlation fractal dimen- 
sion (measured by least-square linear regression of log(r) 
and log(p r (r)) for 0.1 < r < 5 km) is D = 1.54 for SHLK 
and D = 1.73 for HCS (see black lines in Figure 5). The 
faster decay for r < 0.1 km is due to location errors, and 
the roll-off for distances r > 5 km is due to the finite thick- 
ness of the seismogenic crust. The difference in the D- value 
between the two catalogs may come from larger location er- 
rors in HCS, but also from errors in estimating the D- value 
(see Kagan [2004c] for a discussion of errors and biases in 
determining the fractal correlation dimension). 

To check the accuracy of the D-value, we tested a syn- 
thetic catalog, generated with the same number of events 
as the real catalog. We used random longitudes and lat- 
itudes, with a uniform epicentral distribution within the 
same boundaries as real data. Depths in the synthetic 
catalog were chosen by shuffling the depths of earthquakes 
from the real catalog to keep the same depth distribution. 
The fractal dimension in the range 0.1 < r < 5 km was 
D = 2.93 ± 0.04: a little smaller than the value D = 3 ex- 
pected for a purely uniform distribution in a volume. This 
discrepancy is due to three factors: the finite number of 
events, the non-uniform distribution of depths, and finite 
size effects. 

The spatial clustering of earthquake hypocenters mea- 
sured for the full catalog is due both to the fault network 
structure and to earthquake interactions. The latter mod- 
ify long-term spatial distribution by increasing the fraction 
of small inter-event distances, that is decreasing the frac- 
tal dimension [Kagan, 1991a]. The clustering of aftershocks 
around mainshocks, and of secondary aftershocks close to 
the direct aftershock of the mainshocks, and so on, can cre- 
ate a fractal distribution. This results from the scaling of the 
aftershock zone with magnitude coupled with the GR law, 
without any underlying fractal fault structure [Helmstetter 
and Sornette, 2002]. 

Over very long time intervals, triggered earthquakes 
should be located uniformly on the fault network, and the 
fractal dimension of the whole catalog should reach a con- 
stant value. In our model described in equation (5), p r (r) 
represents the "long-term" time-independent spatial dis- 
tribution of seismicity. Distribution p r {r) gives the aver- 
age density of earthquakes at a distance r from the main- 
shock due to fault structure geometry and the faults time- 
independent heterogeneity. This does not take into account 
earthquake interactions. The effect of the mainshock, which 
increases or decreases the seismicity rate at close distances 
by modifying the stress, is described in the factor /(<r, t) 
in (5). Therefore, we should not use in (5) the function 
p r (r) estimated from the catalogs of Southern California 
seismicity because these catalogs cover less than 20 years 
and contain a majority of aftershocks. If we had included 
that function, we would be double-counting earthquake in- 
teractions. 

To estimate and remove the time dependence of the spa- 
tial distribution of inter-event distances, we have measured 
that distribution p r (r,t) using only earthquake pairs with 
an inter-event time r in the range [t, t + dt\. The results 
are shown in Figure 5. As the minimum inter-event time 



increases, the fraction of small distances will decrease. The 
fractal dimension of p r (r,t) increases between D ~ at 
times t = 5 minutes up to D w 2 for t — 2500 days (see 
Figure 6). This maximum inter-event time of 2500 days is 
long enough so that earthquake interactions are negligible 
compared to tectonic loading, and a very small fraction of 
earthquake pairs with r > 1000 days belong to the same af- 
tershock sequence. This value D = 2, measured for t = 2500 
days, can thus be interpreted as the fractal dimension of the 
active fault network. 

6. Conclusion 

Although large earthquakes are much more important 
than smaller ones for energy release, small quakes have col- 
lectively the same influence as large ones for stress changes 
between earthquakes, due to seismic spatial clustering. Be- 
cause the stress drop is constant, the stress change induced 
by an earthquake of magnitude m at the location of other 
earthquakes (located on a fractal network of dimension D) 
increases with mas~ 

1Q Dm/2 Measuring directly the scal- 
ing of such stress change with magnitude is difficult, because 
the accuracy of earthquake locations and focal mechanisms 
is limited. We can, however, estimate indirectly the rela- 
tive importance of small and large events for stress transfer 
by studying the properties of triggered seismicity. If quakes 
are triggered by static stress changes, then the number of 
such triggered events should also scale as ~ \Q Dm / 2 with 
the magnitude of the trigger. 

We have measured the average seismic rate triggered by 
an earthquake for Southern California seismicity. We found 
that the rate of triggered events decays with time accord- 
ing to Omori's law ~ l/(t + c) p with p = 0.9 and c < 3 
minutes (after correcting for the increase in the complete- 
ness threshold after a large mainshock, see eq. 12). This 
decay is independent of the mainshock magnitude m for 
2 < m < 7.5. We also found that the magnitude of trig- 
gered aftershocks follows the Gutenberg-Richter law with 
b — 1 and is independent of the mainshock magnitude. The 
rate of triggered quakes increases with mas~ 10 am , with 
an exponent a = 1.05 ± 0.05. 

We have measured the fractal correlation dimension D for 
two different catalogs of relocated earthquakes in Southern 
California, given by the exponent of the cumulative distri- 
bution of the distances between all pairs of hypocenters. 
This fractal dimension D Ri 1.6, measured in a limited time- 
window, characterizes the spatial clustering of earthquakes 
due to fault network structure and earthquake interactions. 
By using only earthquake pairs with inter-event times larger 
than a threshold to compute D, we have tried separating the 
effects that fault geometry and earthquake triggering have 
upon spatial clustering. For inter-event times larger than 
1000 days, we obtain D » 2. Thus, our result a « D/2 
supports the assumption that earthquakes are triggered by 
static stress from past events. 

The fact that a is nearly equal to the 6-value has im- 
portant consequences for earthquake triggering. It means 
that equal magnitude bands contribute equally to stress 
at the location of a future hypocenter, e.g., earthquakes 
with 2 < m < 5 have collectively the same importance as 
5 < m < 8 events for stress changes and for triggering, be- 
cause the frequency of small earthquakes compensates for 
their lower triggering potential. Even if explicit stress cal- 
culations based on cataloged earthquakes have limited ac- 
curacy, we can estimate how small earthquakes affect earth- 
quake triggering. This can be done with a spatial resolution 
of a few km, using an ETES model or the more physical 
rate-and-state model. 
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Table 1. Scaling of aftershock productivity and earthquake 
frequency with magnitude. Times are in days and distances in 
km. Parameters Am, Dp, Da, Nl p, Nl^a and Tp are used 
in the selection of mainshocks and aftershocks (see section 3.3). 
Parameters t m i n , i m ax and A m i n define the time interval used 
to estimate the scaling of the number of aftershocks with the 
mainshock magnitude (see section 3.4). N is the total number 
of events with m > m^ and N m s is the number of mainshocks. 
Kq defined by (13) measures aftershock productivity. 
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m d 


time 
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Dp 
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Nl,f 
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Figure 1. (a) Average rate X(mM,t) of m > 3 earth- 
quakes as a function of the time t after the triggering 
earthquake, for different values of the magnitude tum of 
the triggering earthquake increasing from 2 to 7 with a 
step of 0.5 from bottom to top. The roll-over at short 
times is due to the incompleteness of the catalog fol- 
lowing a strong mainshock. Black lines show the fit of 
\(niM,t) by K(mM)/t 9 in the time interval [i m i n (tom), 
tmax(mM)], where i m in(niA:f) = 0.0003 day (26 sec) for 
rriM < 4.5 mainshocks. For larger mainshocks t m i n (m,M) 
is estimated using expression (12) as the time when the 
catalog is complete for m > 3. The maximum time is ei- 
ther 10 days or the time when the seismicity rate decays 
below A m in = 0.01 day -1 , (b) Aftershock productivity 
K(rriM) as a function of tum (circles) and cumulative 
magnitude distribution P(m) (crosses). Solid lines are 
linear regressions of K(itim) and P(m) with exponent 
respectively equal to a = 1.07 and b = 0.94. 
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Figure 2. Same as Figure 1 except that we have used 
rrid = 2 and we have corrected the seismicity rate for 
missing early aftershocks using (12) (assuming GR law 
with 6=1). We fit the seismicity rate in the time interval 
0.002 < t < 10 days and for X(t,m M ) > 0.1 day" 1 . The 
fit of K{m M ) give K = 0.0041 day p_1 and a = 1.05. 




time after mainshock (days) 

Figure 3. Same as Figure 2 with different parameters 
D A = 0.5 km, N L>A = 2, Dp ~ 3 km, N L , F = 3, and 
without including aftershocks in the influence zone of pre- 
vious aftershocks. 
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Figure 4. Cumulative magnitude distribution of trig- 
gered earthquakes, for mainshock magnitudes ranging 
from 2 (below) to 7 (top) with a bin size of 0.5. The 
curves have been normalized by a factor lO°' 5mM for clar- 
ity. Dotted lines represent a GR law with b — 1 for 
reference. The number of events decreases with mm (be- 
tween N = 11502 for 7 < m M < 7.5 down to N = 1328 
for 7 < m,M < 2.5) due to the smaller fraction of small 
earthquakes considered as mainshocks, and due to the 
shorter time window used for aftershock selection of small 
mainshocks. 
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Figure 5. Distribution of distances between hypocenters 
p r (r,t) for SHLK (a) and HCS (b) catalogs, using only 
earthquake pairs with inter-event times in the range [t, 
1.254], where t increases between 1.4 minutes (blue curve) 
to 2500 days (red curve). The fractal dimension of p r (r), 
measured for 0.1 < r < 5 km for all earthquakes (black 
lines), is D = 1.54 for SHLK catalog and D = 1.73 for 
HCS catalog. 
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Figure 6. Fractal dimension of p r (r,t) as a function of 
inter-event time t, using SHLK catalog (diamonds) and 
HCS catalog (circles). 



